A Simple Explanation of A Spectral Algorithm for Learning Hidden Markov Models

نویسنده

  • Matthew J. Johnson
چکیده

This document is my summary explanation of the algorithm in “A Spectral Algorithm for Learning Hidden Markov Models” (COLT 2009), though there may be some slight notational inconsistencies with the original paper. The exposition and the math here are quite different, so if you don’t like this explanation, try the original paper! The idea is to maintain output predictions in a recursive inference algorithm, instead of the usual method of maintaining hidden state predictions, and to represent the HMM only in terms of the maps necessary to update output predictions given new data. This approach limits the inference computations the algorithm can perform (it can’t answer any queries about the hidden states since it doesn’t explicitly deal with them at all), but it also reduces the complexity of the model parameters that are learned and thus makes learning easier. The learning algorithm uses an SVD and matrix operations, so it avoids the local-optima problems of EM or any other algorithms based on maximizing data likelihood over the usual HMM parameterization. The COLT paper includes error bounds and analysis. Notation. For a vector v in a subspace V ⊆ R and a matrix C with linearly independent columns and range(C) ⊇ V I will use [v] to denote the coordinate vector of v relative to the ordered basis given by the columns of C, and [v] or simply v to denote the coordinate vector of v relative to the standard basis of R. Similarly, for a linear map A : V → V I will use [A]C to denote the matrix of A relative to domain and codomain bases given by the columns of C, and [A] to indicate its matrix relative to standard bases. For a matrix A I will also use [A]ij to denote the (i, j)th entry.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Taylor Expansion for the Entropy Rate of Hidden Markov Chains

We study the entropy rate of a hidden Markov process, defined by observing the output of a symmetric channel whose input is a first order Markov process. Although this definition is very simple, obtaining the exact amount of entropy rate in calculation is an open problem. We introduce some probability matrices based on Markov chain's and channel's parameters. Then, we try to obtain an estimate ...

متن کامل

An Adaptive Approach to Increase Accuracy of Forward Algorithm for Solving Evaluation Problems on Unstable Statistical Data Set

Nowadays, Hidden Markov models are extensively utilized for modeling stochastic processes. These models help researchers establish and implement the desired theoretical foundations using Markov algorithms such as Forward one. however, Using Stability hypothesis and the mean statistic for determining the values of Markov functions on unstable statistical data set has led to a significant reducti...

متن کامل

Spectral learning of graph distributions

This work draws on previous works regarding spectral learning algorithm for structured data (see [5], [9], [2], [1], [8]). We present an extension of the Hidden Markov Models, called Graphical Weighted Models (GWM), whose purpose is to model distributions over labeled graphs. We describe the spectral algorithm for GWM, which generalizes the previous spectral algorithms for sequences and trees. ...

متن کامل

A Spectral Algorithm for Learning Hidden Markov Models

Hidden Markov Models (HMMs) are one of the most fundamental and widely used statistical tools for modeling discrete time series. Typically, they are learned using search heuristics (such as the Baum-Welch / EM algorithm), which suffer from the usual local optima issues. While in general these models are known to be hard to learn with samples from the underlying distribution, we provide the firs...

متن کامل

مدل سازی فضایی-زمانی وقوع و مقدار بارش زمستانه در گستره ایران با استفاده از مدل مارکف پنهان

Multi site modeling of rainfall is one of the most important issues in environmental sciences especially in watershed management. For this purpose, different statistical models have been developed which involve spatial approaches in simulation and modeling of daily rainfall values. The hidden Markov is one of the multi-site daily rainfall models which in addition to simulation of daily rainfall...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1204.2477  شماره 

صفحات  -

تاریخ انتشار 2012